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We give a one-dimensional quantum cellular automaton (QCA) capable of simulating all others. 
By this we mean that the initial configuration and the local transition rule of any one-dimensional 
QCA can be encoded within the initial configuration of the universal QCA. Several steps of the 
universal QCA will then correspond to one step of the simulated QCA. The simulation preserves 
the topology in the sense that each cell of the simulated QCA is encoded as a group of adjacent 
cells in the universal QCA. The encoding is linear and hence does not carry any of the cost of the 
computation. We do this in two flavours: a weak one which requires an infinite but periodic initial 
configuration and a strong one which needs only a finite initial configuration. 
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I. INTRODUCTION 

In this section we give a quick overview of QCA and explain why they matter. We then move on to explain why 
the notion of intrinsic simulation is so important in the CA community, we then list some related construction and 
provide an outline for the paper. 

A. Quantum cellular automata 

One-dimensional cellular automata (CA) consists of a line of cells, each of which may take one in a finite number of 
possible states. These evolve in discrete time steps according to a local rule, applied synchronously and homogeneously 
across space. Because they are a physics-like model of computation it seems very natural to study their quantum 
extensions. The flourishing research in quantum information and quantum computer science provides us with ap- 
propriate context for doing so, both in terms of the theoretical framework and potential applications. Indeed this 
field has already brought to light a number of theoretical results about one-dimensional quantum cellular automata 
(QCA). For instance Schumacher and Werner [22 have proved that any local, translation-invariant unitary evolution 
over a line of finite-dimensional quantum systems takes the form of a pattern of more elementary unitary evolutions - 
repeated across time and space. And right from the very birth of the field with Feynman's 1986 paper, it was hoped 
that QCA may prove an important path to realistic implementations of quantum computers [H] - mainly because 
they eliminate the need for an external, classical control and hence the principal source of decoherence. Other possi- 
ble aims include providing models of distributed quantum computation, providing bridges between computer science 
notions and modern theoretical physics, or understanding the dynamics of some quantum physical system in discrete 
spacetime, i.e. from an idealized viewpoint. As we shall see in section [TT] of the paper, any QCA can be put into the 
form given by Figure [T] For our purpose, we will take this particular space and time tiling of a scattering unitary U 
as the actual definition of QCA. 

B. Intrinsic universality 

The most popular cellular automaton is Conway's 'Game of Life', a two-dimensional CA which has been proven 
to be universal for computation - in the sense that any algorithm can be encoded within its initial state and then be 
run by the cellular automaton's evolution. This was accomplished by simulating any Turing Machine (TM) within 
the automaton, and since Turing Machines have long been regarded as pretty much the best definition of 'what an 
algorithm is' in classical computer science, this could have meant the end of the story to many people. Yet researchers 
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FIG. 1: Partitioned one-dimensional QCA with scattering unitary U. Each line represents a quantum system, in this case a 
whole cell. Each square represents a scattering unitary U which gets applied upon two cells. Time flows upwards. 



in CA have always been looking for more than just running any algorithm, seeking to run distributed algorithms in 
a distributed manner, model some other phenomena together with their spatial structure, or make use of the spatial 
parallelism which is inherent to the model as these arc the features which are modelled by CA and not by TM. 
And hence they have had to come back [T] to the original meaning of the word 'universal', namely the ability for 
one instance of a computational model to be able to simulate all other instances of the very same computational 
model. Nowadays there is an impressive number results about intrinsically universal CA as reviewed for instance 
in [SI [T7] - i.e. results on cellular automata capable of simulating all others efficiently and directly. (Incidentally 
of course they also simulate those CA which are capable of simulating the TM.) Or to put things differently, most 
of the CA community now seems to consider that a good notion of simulation is one which preserves the topology 
and the parallelism of the simulated CA, in some simple and explicit fashion. In the same manner, studying QCA 
rather than QTM for instance means we bother about the spatial structure of things, whether for the purpose of 
describing a quantum protocol, modelling a quantum physical phenomena, or again taking into account the spatial 
parallelism inherent to the model. Hence we argue that the kind of universality we are looking for is in fact stronger 
than the ability to simulate any quantum Turing Machine. We seek an intrinsically universal QCA, i.e. a QCA 
which can simulate all others efficiently and directly. A good intuition of the notion of intrinsic simulation is given 
by the diagram in Figure [2] but formal definitions follow in section [TTJ In particular, we will distinguish two notions 
of intrinsic universality depending upon whether we allow preparation of an infinite periodic initial configuration or 
insist to have only finite configurations. 



C. Related results 



In the realm of classical computing Durand-Lose |10j . has described an intrinsically universal one-dimensional 
reversible cellular automaton. Our construction will turn out to be a little simpler and cannot be substituted for 
this previous one, because reversible circuit universality requires at least one 3-bit gate if done without the help of 
quantum mechanics. In the realm of quantum computing Shepherd, Franz and Werner |23j have defined a class of 
QCA where the scattering unitary Vi changes at each step i (CCQCA). Via this construct they have built a QCA of 
cell-dimension 12 which is universal in the circuit-sense. Universality in the circuit-sense had already been achieved by 
Van Dam [27] and Raussendorf [21] - the latter uses a two-dimensional QCA but has this inspiring idea of programs 
crossing the data, with computation occurring in the interaction. Watrous |28] has proved that QCA are universal 
in the sense of Quantum Turing Machines. To our knowledge there is no previous work on intrinsically universal 
quantum cellular automata. 



D. Plan 



In section [TT] the reader is provided with the necessary theoretical background on QCA - and the notion of intrinsic 
simulation is transposed to this theory. As in the classical case intrinsic simulation comes in two flavours, with one 
stronger than the other. In section [IIT] we construct a particular QCA, which we show is intrinsically universal, in the 
first sense. In section [TV] we augment this QCA, and show that this results in a particular QCA which is intrinsically 
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FIG. 2: Intrinsic simulation of a QCA by another. (The QCA defined by U simulates the QCA defined by V. In this case we 
need two cells of the 17-QCA in order to encode one cell of the V-QCA, and we need to run the [/-QCA for four time steps 
in order to simulate one time step of the V-QCA. More generally the challenge is to come up with an initial configuration of 
the (7-QCA so that it behaves just as the V-QCA with respect to the encoded initial configuration, after some fixed number of 
time steps. Clearly such an encoding will have to hold the configuration of the V-QCA as well as some way of describing the 
scattering unitary V .) 



universal, in the second stronger sense. We conclude in section W\ 



II. THEORETICAL BACKGROUND 
A. One-dimensional QCA 

We now recall the fundamental definitions and properties of one-dimensional QCA. In what follows E is a fixed finite 
set of symbols (i.e. 'the alphabet', describing the possible basic states each cell may take) and q is a symbol such that 
q £ E, which will be known as 'the quiescent symbol', which represents an empty cells. We write q + E = {q} U E for 
short. 

Definition 1 (Finite configurations) A (finite) configuration c over q + E is a function c : Z — > q + E, with 
i i — > c(i) = Ci, such that there exists a (possibly empty) finite interval I verifying i € I Cj € g+E and i ^ / =>■ Cj = q. 
The smallest such interval I is called interval domain of c, and is denoted idom(c). The set of all finite configurations 
over q + E will be denoted C g +s ; whilst the set of all finite configurations having interval domain included in a finite 
interval J will be denoted C^ +s . 

Whilst configurations hold the basic states of an entire line of cells, and hence denote the possible basic states of the 
entire QCA, the global state of a QCA may well turn out to be a superposition of these. The following definition 



works because C„+e is a countably infinite set. 



Definition 2 (Superpositions of configurations) Let IHe 



be the Hilbert space of configurations, defined as 



follows. To each finite configuration c is associated a unit vector \c) , such that the family (|c)) c 



orthonormal 



basis of JC e , +E 
subspace of !Ke 9+E 



A superposition of configurations is then a unit vector in ?{e 9+s . 



We also denote by JCg 



the 



spanned by the configurations in Q J 
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Note that this space of QCA configurations is the same one as in J2J [3] [T2] [T3J [25] . It is isomorphic to the cyclic one 
considered in [15] , but fundamentally different from the finite, bounded periodic space of [27] and the infinite setting 
of [22]. The infinite setting of [22] is slightly more permissive, but would force us to abandon the traditional setting 
of HUbert spaces and move towards C*— algebras. We choose not to do so out of simplicity, but our results apply to 
that context also since QCA have the same form in both contexts, as explained in [3J. 

Definition 3 (Unitarity) A linear operator G : JCe +E — ► 3-Ce 5+s is unitary if and only if {G\c) \ c £ C 9 +s} * s an 
orthonormal basis of "Kp , „ . 

Definition 4 (Shift-invariance) Consider the shift operation which takes configuration 

c = . . . c^iCiCi + i . . . to c! = . . . c' i _ 1 c l i c l i+1 . . . where for all i c[ — Cj+i. Let a : 3is q+s — > ^e q+s be its linear extension 
to superpositions of configurations. A linear operator G : !Ke +s — ► !Ke +s is said to be shift invariant if and only if 
Go = oG. 

Definition 5 (Causality) A linear operator G : JCe +s — ► +s is said to be causal with with radius ~ if and 
only if for any p, p' two states over Jfe, , and for any i 6Z, we have 

p\i,i + i = p'\i,i + i => GpG^\i = Gp'G^\i. (1) 

Here we used the notation p\ j to mean restriction of p to the region J in the sense of the partial trace, and G^ is the 
Hermitian adjoint of G. In the classical case, the definition would be that the letter to be read in some given cell i 
at time t + 1 depends only on the state of the cells i and i + 1 at time t. Transposed to a quantum setting, we get 
the above definition: to know the state of cell number i, we only need to know the states of cells i and i + 1 before 
the evolution. 

We are now ready to give the formal definition of one-dimensional quantum cellular automata. 

Definition 6 (QCA) A one-dimensional quantum cellular automaton (QCA) is an operator 
G : J£e ?+S — ► 3fe„ +E which is unitary, shift-invariant and causal. 

This is clearly the natural axiomatic quantization of the notion of cellular automata. It was first given in [3J but stems 
from equivalent definition in the literature, phrased in terms of homomorphism of a C*-algebra [22]. There are other 
definitions in [2j |7j [T2J [13] [15] [27 28 which are not axiomatic, in the sense that they all make particular assumptions 
about the form of the local action of G, and G is then defined as a composition of these actions. That of [7] turns out 
to be almost equivalent to ours [3] [22] in the end, as explained in 0]. 

B. Intrinsic simulation of one-dimensional QCA 

The notions of intrinsic simulation of one CA by another arises with Banks in [5], but is not formalized until 
Albert and Culik pp. Even then, this apparently simple concept gives rise to two different competing definitions, 
which we may call intrinsic simulations and strong intrinsic simulations respectively. Both are nicely explained in [5J , 
but subsequent works tend to focus on weak intrinsic simulations, as is nicely reviewed in [TT], [T5] . Hence let us start 
with an explanation of the more modern notion of intrinsic simulation. 

The basic intuition in order to say that G' simulates G is that we can translate the content of each cell of G into cells 
of G', run G' , and then reverse the translation - and that this three steps process will be equivalent to just running 
G. But first we must make it clear what we mean by 'translate'. This translation should be simple (the idea is that 
the real cost of the computation is carried over by G"), it should preserve the topology (the idea is that each cell of 
G is encoded into cells of G' in a way which preserves whom neighbours whom) , and it should be faithful (the idea is 
that no information should be lost in translation) . This latter requirement translates into a precise notion in quantum 
theory, which is that of being an isometry, i.e. an inner product preserving evolution with EnS Enc = I. This same 
requirement also coincides with the fact that we would like this translation to be a physical process, i.e. that an actual 
translating machine can actually be built in theory. This is because quantum mechanics limits all physical evolutions 
to be isometries. With these observations in mind we reach the following definitions. 

Definition 7 (Isometric coding) Consider q + £ and q" + S" two alphabets with distinguished quiescent states q 
and q" , and such that \q + Y,\ < \q" + Consider < H q +Y, and ^K q "+^" the HUbert spaces having these alphabets as 
their basis, and JCe 5+S7 3{.e „ +s „ the HUbert spaces of finite configurations over these alphabets. 

Let E be isometric linear map from 3~C g +£ to r K q "+Y." which preserves quiescence, i.e. such that E\q) = \q"). It 
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FIG. 3: The notion of intrinsic simulation made formal. 

trivially extends into an isometric linear map Enc = from 3ie 9+E into %e h +s h> which we refer to as an 

isometric encoding. 

Let D be isometric linear map from 3~Cg"+£" to "Kq+T, ® 3~Cj"+£" which also preserves quiescence, in the sense 
that D\q") — \q) ® W')- It trivially extends into an isometric linear map Dec — ((^) z D) from JCe „ +E „ into 
3^e, +E ®^e,» +E //i which we refer to as an isometric decoding. 

The isometries E and D define an isometric coding if the following condition is satisfied: 

V|V> eJf e , tE , 3\</>) e5fe 5 „ +E „ / \^)^\cf > ) = Dec(Enc\^)). 

(The understanding here is that Dec is morally inverse function of Enc, but we may leave out some garbage \<f>) in 
the way.) 

Definition 8 (Direct simulation) Consider q + £ and q" + S" two alphabets with distinguished quiescent states q 
and q" , and two QCA G and G" over these alphabets. We say that G" directly simulates G, if and only if there exists 
an isometric coding such that 

VzeN, V|V) e Xe q+S , 3\<t>) G M e<l „ +S „ / (G»)®|0) = ^ec(G" l ( J Bnc|V))). 

Unfortunately this is not quite enough for intrinsic simulation. Often we want to say that G' simulates G even though 
the translation: 

- takes one cell of G into several, not just one cell of G"; 

- hence the quiescent symbol q becomes a quiescent word q'; 

- G' needs be run t times instead of just once. 

All of these changes are made formal via the notion of grouping, as in the following definitions. 

Definition 9 ( Grouping) Let G' be a QCA over alphabet </+£'. Let s and t be two integers, q" a word in (g' + S') s , 
and E" = Consider the iterate global evolution G up to a grouping of each s adjacent cells into one supercell. 

Lf this operator can be considered to be a QCA G" over q" + then we say that G" is an (s,t,q')~ grouping of G' . 

Definition 10 (Intrinsic simulation) Consider q + Y. and q' + £' two alphabets with distinguished quiescent states 
q and q' , and two QCA G and G' over these alphabets. We say that G' intrinsically simulates G if and only if there 
exists G" some grouping of G' such that G" directly simulates G. 

In simple words G' intrinsically simulates G if and only if there exists some isometry E which translates each cell 
of G into s cells of G', such that if we then run G n and translate back, the whole process is just equivalent to just a 
run of G. This understanding is captured by Fig. [3] Notice, however, that q the quiescent symbol of G gets encoded 
into words \q') = E\q), which in general may not be the same as \q' s ), i.e. s quiescent cells of G'. Hence with this 
notion of weak intrinsic simulation, we are indirectly assuming that the initial state of simulating QCA G' could be 
prepared in a non-finite configuration, i.e. one which does not end and begin with only q' symbols, but repeated 
q' words instead. Formally this is not a problem, since G' the (s, t, <7')-grouping of G' remains a valid QCA with 
quiescent symbol q'. Yet, depending on the application, one may wonder whether this notion of intrinsic simulation 
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is the appropriate notion. For instance, if the implementation of G' cannot be fed with q' words left and right as the 
computation unravels, and if we do not know when the computation is supposed to stop, then this notion of intrinsic 
simulation may fail. These considerations will become clearer in the beginning of Section |IV| once illustrated with 
an example. Nevertheless they are exactly the ones which motivated the notion of strong intrinsic simulation. 

Definition 11 (Strong isometric coding) Consider q + E and q" + E" two alphabets with distinguished quiescent 
states q and q" , and such that \q + E| < \q" + E"|. Consider !K g +s and Jfy+E" the Hilbert spaces having these 
alphabets as their basis, and ^Ke q+S , 'Kq , i+s „ the Hilbert spaces of finite configurations over these alphabets. 
Let E,I be isometric linear maps from to l Jin' l +T," , where I preserves quiescence, i.e. is such that I\q) = \q"). 

Let L, R be isometric linear maps from 3~C®_£ S to 3~C®r +E ,/ • Given any finite interval J, these can be combined into an 
isometric linear map Enc J = (I®L(8)&)jE&)R^I) from Jfe, +E into IKe „ +E „, which we refer to as the isometric 
encoding for interval J. 

Let D be isometric linear map from l H q "+<£," to JC g+ s ® !Hqii+-£", in the sense that D\q") = \q) ® \q")- It trivially 
extends into an isometric linear map Dec — ((^) Z -D) from J£e „ +E „ into %Q q+Tj ® 3ie 9 „ +s „, which we refer to as a 
strong isometric decoding. 

The isometries L, E, R and D define a strong isometric coding if the following condition is satisfied: 

V|V> G JC e ^ +E , 3|0) € 5tev, +E „ / \^)®\4>) = Dec(Enc J \^)). 

In this scheme we now allow ourselves to know in advance whether the non-quiescent cells are all within the region 
J. Over the J region we apply our encoding ^ E as usual. Over left and right quiescent tails we do not do much, since 
(^) / sends | . . . qq . . .) to | . . . q"q" . . .) anyway. In the direct surroundings of the J region, we do something special, 
namely we apply L on the left and R on the right. Intuitively L/R will be used to work at 'encoding more left/right 
cells' as the configuration grows through the computation, leading to the following expected notion of strong direct 
simulation. 

Definition 12 (Strong direct simulation) Consider q + E and q" + E" two alphabets with distinguished quiescent 
states q and q" , and two QCA G and G" over these alphabets. We say that G" strongly directly simulates G, if and 
only if there exists a strong isometric coding such that for all interval J we have: 

V ? : g N, V|tf) G H e j +s , B\4>) G W eg „ +E „ / ® \4>) = Dec(G" 1 {Enc J \^))) . 

Again this is not quite enough for intrinsic simulation, since we want to say that G' simulates G even though the 
translation takes one cell of G into s cells of G', and one step of G into t steps of G' . But this time we want to 
make sure that the quiescent tails \ . . . qq . . .) are taken into quiescent tails | . . . q'q' . . .). So we restrict the notion of 
grouping. 

Definition 13 (Strong grouping) Let G' be a QCA over alphabet q' + E'. Let s and t be two integers, and E" = 
E' s \{(7 /S }. Consider the iterate global evolution G up to a grouping of each s adjacent cells into one supercell. If this 
operator can be considered to be a QCA G" over q' s + E", then we say that G" is an (s,t)- strong- grouping of G' . 

Definition 14 (Strong simulation) Consider g + E and q' + E' two alphabets with distinguished quiescent states q 
and q' , and two QCA G and G' over these alphabets. We say that G' strongly intrinsically simulates G if and only if 
there exists G" some strong grouping of G' such that G" strongly directly simulates G. 

Roughly we can say G' strongly intrinsically simulates G if and only if there exists some topology-preserving 
isometry Enc which translates each finite configuration of G into a finite configuration G", such that if we then run 
C'and translate back, the whole process is just equivalent to just a run of G. 

C. The structure of one-dimensional QCA 

As we have just seen one-dimensional QCA can be defined in quite general terms as a local, translation-invariant, 
global unitary evolutions G acting over a line of finite dimensional quantum system. But contrary to its classical 
counterpart the definition does not yield an immediate way of constructing / enumerating all of the instances of this 
model. So if we just stick to that one definition QCA remain excessively abstract, hard-to-grasp mathematical object. 
Fortunately Schumacher and Werner {22| , and later Arrighi et al. [3] in the context of finite unbounded configurations, 
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FIG. 4: Werner-style Margolus neighbourhood QCA [25]. The elementary unitary evolutions Uo and U\ are alternated repeat- 
edly as shown. The letters p, q, r, s with pq = rs denote the dimensions of the quantum systems. 
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FIG. 5: Partitioned QCA simulating a Werner-style Margolus neighbourhood QCA. The scattering unitary U is a (pq + 1) x 
(pq + 1) 2 matrix, essentially acting like either Uo or Ui on parts of the input according to the control qubit. The black box could 
be just any unitary, the lines upon which it acts are added just so that U takes subsystems of an equal number of dimensions 
left and right. 

have proved that any such mathematical object is structured according to Figure [4j i.e. that one-dimensional QCA 
admit a 2— layered unitary block representation. These elementary unitary evolutions Uq and U\ are well-understood 
objects, i.e. they can be just any finite unitaries, as implemented for instance by some circuit of universal quantum 
gates for instance. Cheung and Perez-Delgado have proposed a definition of QCA directly in terms of local unitary 
evolutions, but thanks to this theorem the two are equivalent in one-dimension (the situation is similar in n-dimensions, 
see jl]). This seems to be as good as it gets if we are not willing to modify the space upon which G is acting. However 
the notion we are interested in is intrinsic universality, and in this context we are quite willing to interleave some extra 
cells for the purpose of simulating one QCA by another. And so it is not difficult to see that any such [/of/i-structured 
QCA can be straightforwardly made into a t/£/-structured QCA, as made explicit by Figure [5] Hence for the purpose 
of this paper, can really just restrict our attention to (7J7-structured QCA as in Figure [T] These are also referred to 
as Partitioned QCA for instance in [27], by analogy with classical Partitioned CA. 

Definition 15 (Partitioned QCA) A partitioned one- dimensional quantum cellular automaton (PQCA) over 
3^e, +E is defined by a unitary U an operator U : ® 3~Cq+s — ► 3~C g+ £ <8> ^K q +s, such that U\qq) = \qq), i.e. 

one which we take two cells into two cells and preserves quiescence. Let G = ((^) z U) the operator over !Ke g+E ■ The 
induced global evolution is G at odd time steps, and o~G at even time steps, as in Fig. [7J 

The defining elementary unitary evolution U will be referred to as the scattering unitary, by analogy with quantum 
field theory. Hence we will talk about 't/-defined QCA' in order to designate the PQCA having scattering unitary U. 
In this paper we demonstrate how a particular /7-defined QCA can simulate any ^-defined QCA, that is for any V. 
But again let us insist that this restriction to PQCA is without loss of generality, since we have just explained that 
any QCA can be put into the form of a PQCA. Notice that in Fig. [4] two vertical lines were denoting two subcells 
making up one cell of the QCA. But in the definition of PQCAs, we have referred to each of these subcells as just 
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one cell. This was simply out of convenience, but one should keep in mind that one cell of a general QCA really 
corresponds to two cells of a PQCA. 

III. INTRINSIC UNIVERSALITY 

A. What do we need for universality? 

In subsection |II A| we have recalled the formal definition of One-dimensional QCA, and in subsection |II C| we have 
recalled they have a simple circuit-like structure. Ultimately the picture one needs to have in mind in order to follow 
this section is that a QCA is just that of Fig. [l] 

In subsection |II B| we have provided a formal definition for the notion of intrinsic simulation. But again the picture 
one needs to have in mind in order to follow this section just that of Fig. [2] 

The purpose of this section is to find a particular [/-defined QCA, and which is capable of intrinsically simulating any 
I^-defined QCA, whatever the V. In order to describe that [/-defined QCA we need to describe two things: 

- How its cells are like (i.e. what are the vertical lines of Fig. [I] made of, what is the dimensionality of [/?). By 
definition of QCA they are finite dimensional quantum systems of some fixed dimension d of course, but for clarity 
we will decompose these into subsystems of dimension di, and we will give names to these subsystems according to 
their purpose (i.e. the vertical lines of Fig. [I] are 'buses'). 

- How U acts upon a pair of these cells, and more precisely upon the subsystems making up the pair of cells. We will 
say this informally, but we will also provide a formal circuit descriptions of U, and check that such a U is indeed a 
unitary. 

Before continuing with our detailed discussion of the QCA, it is useful for the sake of clarity to make precise some of the 
vocabulary and conventions that will be encountered in the rest of the paper. By a subsystem we mean a constituent 
of a cell that serves a specific function in our QCA. A subsystem may take many different values. Subsystems have 
names written in bold. The term signal is used to refer to the value of a certain subsystem when it consistently 
travels between a cell and its left/right neighbour. In the space-time diagram of the QCA, a signal looks like a 'line' 
propagating through cells. Usually we take the name of a signal the name of the subsystem that takes those values. 
This section will be organized as follows: first we give an intuitive idea about the mechanism we used to solve this 
problem, then we will explain different components of the QCA in detail, and finally we will see how the whole system 
fits together. 

B. Intuition 

The cells of our universal, [/-defined, QCA will have a subsystem called data used for encoding one qubit of 
information about the state of a cell of the simulated, U-defined QCA. Hence one simulated cell (dotted oval in Fig. 
[6]) will in general be encoded as several adjacent simulating cells (small grey ovals in Fig. pp. We also need to simulate 
the action of some arbitrary unitary V, and so the cells of our universal QCA will have a subsystem called program, 
holding some description of one of the elementary universal quantum gates that make up V. Hence V (dotted box 
in Fig. [6]) will in general be encoded as several adjacent simulating cells (small black ovals in Fig. [6]). Because V 
originally acts upon two cells, it is encoded in the surroundings of the two encoded cells. 

During a first phase, as time unravels, the information held in the data subsystem of the encoded cells remain 
stationary, and so we can think of them as some stationary data signals. Meanwhile the information held in the 
program subsystems of the surroundings travels at lightspeed, and we can think of them as moving program signals. 
The program signals cross the data signals, leaving them unchanged, until they collide between one another. When 
that happens an elementary universal quantum gate is applied on the data subsystems (grey box in Fig. [6]), thereby 
implementing V. Which elementary universal quantum gate gets applied depends on the value of the colliding program 
signals. Where the elementary universal quantum gate gets applied depends on where the collision takes place. 
During a second phase, we need to 'reload' this situation, with the added difficulty that V gets applied in a shifted 
manner. Hence we need to arrange so that the left/right encoded cell travels left/right in order to meet with their 
right/left counterpart on the next site (travelling small grey ovals in Fig. |6j. 

We will explain how to do this in three steps. First we draw a background pattern which allow us to synchronize this 
whole process. Second we let some data signals flow upon this background. Third we let the program signals move 
upon the background, crossing and acting upon the data signals. Note that whenever we write \dgp) for the state of 
a cell, we mean that the subsystem data is in state \d), subsystem program is in state \g), subsystem mode is in 
state \p). 
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FIG. 6: Outline of the simulation scheme. 



C. Ternary background pattern 

First we seek to draw the ternary background pattern of Figure [7] where the large squares cycle through the three 
colours Light grey, Middle grey or Dark grey. The reason why this is useful will become clear in the next paragraph, 
for now it suffices to know that this ternary background pattern will help us synchronize the flow of the data signals 
as in Figure [6] and hence organize the computation as we add more interesting things to the initial configuration, i.e. 
the bottom of the diagram. 

In order to achieve this ternary background pattern each cell must contain a 3-dimcnsional system to code for 
those three different colours. This is really the purpose of subsystem mode: when the mode equals 0, 1 or 2 the 
background colour is Light grey, Middle grey or Dark grey respectively. We must then place some signals at regular 
intervals, travelling at lightspeed and telling mode signals that they must change colour, and this is really role played 
by state |1) of subsystem program. 

Let us show that the scattering unitary U which is given in Figure [12] does the job of generating Figure [7] Observe 
Figure [12] and notice that the content of the mode and program subsystems is always propagated unchanged by 
the scattering unitary U, to the right /left if it comes from the left/right. Moreover observe Figure [l] and notice that 
at the next layer the content of mode and program will again come up from the left/right and hence be propagated 
again to the right/left. Hence whatever value is in the mode or in the program subsystem it just travels at maximal 
speed, right or left, depending only upon its position in the initial configuration. This is just what we mean by 'a 



signal propagating at lightspeed'. In Subsection IIIF we provide all extra information needed about Figure 12 so 
that the behaviour of U becomes fully-determined. We then state that |1) in the program subsystem is the control 
value required for the +1 mod 3 to apply upon the mode subsystem. Hence the 'Change colour' signals are indeed 
implemented by setting some cells to have their subsystem program initialized at |1) as in Figure [7] and the 'Change 
colour' signals indeed propagates at lightspeed, changing the value of the mode signals travelling at lightspeed in the 
opposite direction. 

Notice that later, when we will set the data subsystem to non-|0) values in order to code for simulated cells, or use 
up the other possible values of the program subsystem in order to code for elementary universal gates to be applied 
upon the coded simulated cells, this ternary background pattern will remain unaffected. This is obvious from Figure 



12 and the fact that |1) is the only value of program which triggers the +1 mod 3 gate. 

The required widths of the Light grey and Middle grey zones of the initial configuration vary depending upon the 
V^-defined QCA we are seeking to simulate, in a way which we explain in Subsection |III D and HIE respectively. 



D. Hexagonal data signals flow 

Second we seek to draw the hexagonal data signals flow of Figure [9] That is we want to implement data signals, 
and would like that the data signals remain stationary for a while, and then separate into a left moving and a right 
moving bunch of data signals, only to eventually rejoin their left and right counterparts in order to flow straight in 
time again, as was explained in Figure [6] As previously mentioned, the reason why we want to achieve this particular 
form of data signals flow is that it corresponds to the overall architecture of the QCA we are trying to simulate as 
in Figure [2] with the interaction unitary V taking its inputs as coming both from the left and the right, computing 



10 




FIG. 7: Ternary background pattern. Each small coloured square now represents the state of a cell at time t. This is unlike in 
the previous figures, where boxes would represent a unitary evolution. At the bottom we have the initial configuration. Time 
flows upwards as usual. Each configuration is determined by the one below by pairing up the small squares of the configuration 
below and applying the scattering unitary U to these pairs. The way they are paired up alternates in time: for odd steps cell 
is paired with cell 1, cell 2 with cell 3, etc., whereas for even steps cell 1 is paired up with cell 2, cell 3 with cell 4, etc. The 
Light grey, Middle grey and Dark grey colours correspond to different values of the mode subsystem of the cells. They are 
separated by 'Change colour' signals, represented in Black. (With 000} in Light grey, 1 001} in Middle grey, |002) in Dark grey, 
and |?1?> in Black.) 




FIG. 8: Ternary background pattern. This is the same as Figure [7] except we no longer show the 'Change colour signals'. (With 
|??0) in Light grey, |??1) in Middle grey, |??2) in Dark grey.) 



upon them, and spitting its outputs again both towards the left and the right for another run of V. 
In order to achieve this each cell must contain another 3-dimensional subsystem to code for a data signal. This is 
really the purpose of subsystem data: when the data equals |0), |1) or |2) the cell carries no data, an encoded |1) 
or an encoded |2) respectively. We must then place the encoded data qubits in the Light grey coloured zones of the 
initial configuration, i.e. replacing the 1 000} cells by 1 100} cells in order to code for the presence of an encoded |0), 
and 1 000) by |200) in order to code for the presence of an encoded |1). 

Let us show that the scattering unitary U which is given in Figure [12] does the job of generating Figure [9] The 
intuitive explanation will of course be that the Grey levels of the ternary background pattern are here to tell the 
data whether it can move or not, with the Middle and Dark grey forcing it to remain stationary, and the Light grey 
allowing it to move freely until they are gathered by a Middle grey funnel again. The more formal explanation relies 
on looking at the scattering unitary matrix U which is given in Figure [12] in order to understand when the value of 
the left/right data subsystem is propagated to the right/left, and when it is just left sitting on the left/right. This is 
what determines whether a data signal is stationary or moving at lightspeed. In order to have a complete answer to 
this question one must look at the definition of the S gate in Figure 12 as provided in Subsection IIIF There we find 



11 




FIG. 9: Molding the hexagonal data flow pattern. The ternary background of Figure [5] is now used to tell the data signals 
when it is they should stay stationary, and when it is they should move to go and meet their counterparts. We show the data 
signal as White for an encoded |0) qubit and Black for an encoded |1) qubit. (With |0?0) in Light grey, |0?1) in Middle grey, 
|0?2) in Dark grey, |1??> in White, |2??} in Black.) 




FIG. 10: Hexagonal data flow pattern. This is the same as Figure [9] except we no longer show the ternary background. (With 
|0??> in Grey, |1??} in White, |2??} in Black.) 

that S swaps the left and right data subsystems but only if one of them is |0) (i.e. a data signal moves right/left only 
if there is no data signal there) and if the values of both mode subsystems are |0) (i.e. the Light grey zones of the 
ternary background pattern). Initially the data signals are in a Light grey zones, but they are stuck upon another, so 
they cannot move. Not even the ones on the left and right ends can move - due to the surrounding Middle grey zones. 
Clearly this stationary situation will be maintained until the ones on the left and right ends become surrounded by 
Light grey zones. During this period the data signals are freed two by two, with the one on the left end going to the 
left, and the one on the right end going to the right. Finally the first right moving data signal meets up with the 
first left moving data signal, and so they are stuck by one another for one step. But that one step is enough so that 
the second right/left moving data signal rejoins the first, so that the second one also gets stuck against the first, and 
the first does not bounce back. When all of the right/left moving signals have done so we are back to the original 
situation. Hence this ability of the ternary background pattern to 'mold' the flow of data signals into the hexagonal 
shape of Figure [9] 

Notice that the required width of the Light grey zone of Figure [7] has now become apparent. Since the purpose of this 
Light grey zone is to encode the state of two cells of the simulated QCA, it needs to be twice as big as the number of 
qubits that are needed to encode one cell of the simulated, V-defined QCA. 
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FIG. 11: Circuitry. The Middle grey initial zone of Figure 10 has been modified in order to include two j3)-valued program 
signals (here in Dark grey) and one |2}-valued program signal (here in Light grey). This was done by changing two adjacent 
|001) to 1 031) , and a |001) to |021). We can see that these program signals travel at lightspeed and sometimes collide. When 
they do so a two qubits elementary universal gate gets applied upon the data signals. The nature of the gate depends upon 
the values of the colliding program signals, whereas the position where the gate applies depends upon the place where the 
collision occurs. (Here whenever two |3)-valued program signals intersect a cNot gate get applied to the data signals below 
them, whereas whenever a |2)-valued program signal meets a |3)-valued program signal a I® Not gets applied. With |?3?) in 
Dark grey, |?2?) in Light grey, |20?) and |21?) in White, |20?) and |21?) in Black, and the rest in Grey.) 



E. Collision gates 



Third we seek simulate the scattering unitary V, as in Figure 11 As was explained in Figure [6] the key idea here 



is that during the time the data signals are stationary, they may be crossed by program signals incoming from both 
their left and their right, and sometimes these program signals even collide against one another upon the data signals. 
The value of the colliding program signals is what will specify what should happen to the data signals, through a 
numbering of a set elementary universal quantum gates. The relative positions of the program signals is what will 
specify where this should happen, by determining where they collide. 

In order to achieve this we must change some of the |0) values of the program subsystems of the Middle grey zones 
of the initial configuration, and allow them to take extra values |2) and |3). I.e. we will change some |00T) cells into 
|021) or 1 031) cells and generate program signals carrying value \2) or |3), which will then travel at lightspeed, collide 
and so implement gates upon the data signals. 

Let us show that the scattering unitary U which is given in Figure fl2] does the job of generating Figure [TT] We have 



already shown in Subsection III C that the program signals travel at lightspeed, unaffected. The only thing we need 



to explain is what happens when they collide with one another. Again from Figure [12] and Subsection |IIIF we have 



that whenever two program signals (xprogram and yprogram are both |2) or |3)) cross each other upon some data 
signals (xdata and ydata non-|0)) then some elementary quantum gate is applied upon xdata<g>ydata as given in 



Table |XTJ The mechanism is illustrated in Figure 1 1 but with essentially classical elementary gates - so that we may 
draw their effect. 

Notice that the required width of the Middle grey zone of Figure[7]is starting to become apparent. Since the purpose of 
this Middle grey zone is to hold pairs of program signals, each pair coding for one of the elementary universal quantum 
gate implementing V, its size will depend will depend upon the number of elementary universal quantum gates of the 
circuit-description of V. However the exact size of those Middle grey zones will be determined in Subsection |III G| 
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FIG. 12: The scattering unitary U as a quantum circuit. The three left/right lines represent the three subsystems making up 
the left /right cell. Each square represents an elementary quantum gate being applied of these subsystems, conditional to the 
value of the control systems designated by the horizontal lines. Time flows upwards. The whole thing represents the scattering 
unitary U, it takes two cells and yields back two cells. 



F. Fitting things together 

So overall our universal, (7-defined, QCA consists of a repeated application of one scattering unitary U as in Figure 
[l] Now follows the summarized description of the structure of the cells and the scattering unitary U. 
Structure of the cells In Figure [T] each vertical line do not represent just one qubit but a 36-dimensional quantum 
system made of the subsystems described in the following table. 



Name 


Size 


Function 


data 


3 


To hold one qubit of data of the QCA being simulated 
(|0) =Empty, |1) ^Encoded |0), \2) =Encoded |1)). 


program 


4 


To code for what quantum gate should be applied to the data 
(|0) =Empty, |1) =Change colour, |2)/|3) =cf. Table [it} 


mode 


3 


To synchronize the flow of the data signals (|0) =Light grey 
|1) =Middle grey, \2) =Dark grey). 



TABLE I: Subsystems of the copy band 



Structure of the scattering unitary U. In Figure [T] the scattering unitary U takes two inputs x and y, each of which 
decomposes into three subsystems xdata, xprogram, xmode and ydata, yprogram, yprogram respectively, as 
mentioned. Therefore it could be given as a 36 2 x 36 2 matrix of complex numbers, yet fortunately it decomposes as in 
Figure [T2| In this diagram horizontal lines are control lines. Let us explain what we mean by control lines in general, 
as this will provide us with the main step of the proof of the unitarity of U. Say that a box labelled B applies upon 
systems Si, . . . , S p whilst having an horizontal line crossing systems T\, . . . ,T q , then its effect is to apply the unitary 
evolution Bi whenever the control system is in state |i) - and linearly so. The following well-known lemma formalizes 
this construct. 

Lemma 1 (Unitarity of the control-construct) 

Let Si, . . . , S p and T±, . . . ,T q be quantum systems of finite dimensions \Si\, . . . , \S P \, \T±\, . . . , \T q \. Let {Bi} be a set 
of \Ti\ ■ ■ ■ \T q \ unitary matrices of dimension \Si\ ■ ■ ■ \S P \ x |Si| ■ • • \S P \. Then the evolution defined by linear extension 
of 

|^Ti<8...<8T„|,^Si®...<8£> |^Ti<8...8>T„_g i |^Si®...®S J , j s itself unitary. Here the notation \(f)) s means the quantum 
system S in in state \<f>). 

Proof. (This has appeared many times before in the literature but we reproduce it for completeness.) Since the 
evolution is a square matrix it suffices to check that it has orthonormal columns. 

Let {|i) r i®-® T 9 } and [\j) s i®-® s v] denote the canonical orthonormal basis for Ti ® . . . ® T q and Si ® . . . S p 
respectively. Then the inner product between column ij and column kl is given by 

«*l ® 01-Bi)(l*) ® B k \l)) = StkijlBjBill) = SikSji. □ 
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xprogram ® yprogram 

1 22) 
1 23) 
|32) 
133) 

otherwise 



Action of M 

Swap 
I(g> H 
H®I 
cPhase 



where cPhase stands for 



/ 1 
1 
1 



Hadamard on the second qubit 
Hadamard on the first qubit 



\ e !7r/s 
TABLE II: The M gate. 



This being said we are now in a position to describe the three gates used, in terms of whatever canonical basis state 
their control systems may take. Of course the control systems do not actually have to be in a basis state, by linear 
extension and the above lemma. We shall see that the first two gates are used really just to move the encoded data 
qubits around, whereas the last gate actually does perform the computation. 

• The +1 mod 3 gate. If xprogram^ |0) and yprogram= |4), then xmode is incremented by one modulo 
three. Else this is the identity. (Symmetrically so swapping the roles of x and y.) This step's role is to increment 
the modes, which in turn synchronizes the flow of the data signals. 

• The S gate. If xprogram= |0), and yprogram= |0), then permute 1 01) with 1 10) and 1 02) with |20). Else this 
is the identity. This means we are swapping xdata and ydata only if one of them is empty and the modes are 
'White'. This step's role is to move the data when appropriate, in order to actually perform the flow of data. 

• The M gate. If xdata^ |0), ydata^ |0), xprogram= |2) and yprogram= |2), then the system xdataOydata 

undergoes an elementary quantum gate according to the state of the system xmode®ymode as in Table [IT] 
This step's role is to apply a quantum gate upon two qubits of data, in order to perform the computation. 

In the next section we show how this all fits together to obtain the intrinsic universality. Beforehand however note 
that since we have described the evolution U as a combination of smaller unitary matrices via tensors, composition 



and the control-construct, it is indeed unitary as required by Def. 15 



G. Results 



Universal set of elementary quantum gates. First let us show that we have a universal set of gates available in the 
QCA. The set of gates which the M gate is able to perform upon the data qubits has been chosen to be universal in the 
traditional sense, i.e. any finite dimensional unitary evolution V can be approximated by tensors and compositions 
of these gates. We have not chosen the standard set (cNot, H, Phase) so as to preserve the xy symmetry of the 
unitary evolution U and yet keep the dimension of program to a minimum, but it is easy to see that we can recover 
the standard set since 

cNot\4>) = (I® H)(cPhase) 8 (l <g> H)\ip) 
|1) <g> Phase\ip) = cPhase\l) <g) \ip) 

where the ancilla |1) can be brought over via applications of the Swap gate. 

Addressing each elementary quantum gate. Second, let us show that we may combine our universal quantum gates in 
and arbitrary fashion so as to be able to implement any quantum circuit within the QCA. Consider some circuit C 
made of m elementary two qubit quantum gates ($ , i)j=o...ro— l taken from our universal set of two-qubit gates (Swap, 
I® if, H®I, cPhase). Suppose that the circuit is 2n qubits wide, so that the positions (pi)i=o... m -i telling us where 
to apply those gates can be given as numbers in — (n — 1) ... (n — 1) relative to the center. For instance if 93 = Swap 
and p 3 = 0, this means the fourth gate in the circuit consists in swapping the qubits at the center. Or say if go = I® if 
and po = n — 1, this means that the first gate of the circuit consists in applying a Hadamard to the rightmost qubit. 
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Hence a position actually refers to a pair of qubits, position — n + 1 being the leftmost pair of qubits, — n + 2 the 
second leftmost pair etc. This is coherent with the fact that we have only two qubits gates in our chose universal set 
of quantum gates. 

Then circuit as acting upon the initial state \ip) can be encoded in our QCA via the state: 

ro-l 

|r(c,v)> = ( (g) iOsfWst®*). 

i=m — 1 i—0 

Here the F\ip) region holds the qubits of the circuit, encoded within the cells of the QCA. The way to do this is 



as explained in Subsection HID i.e. |0)/|1) is encoded as 1 100) / 1 200) . (It is important that the presence of data 
|1??)/|2??) be distinguished from the absence of data |0??).) Formally (with \<j)) 3 ' the j th subsystem of |</>)): 

n-l 

F\i>) = (g)/nc|^'|00) 

3=0 

Inc\0) = \l) Inc\l) = \2). 

The {(S>i= m -t h) an d (®"=o lr ') n0 ^d the description of the circuit, encoded within the cells of the QCA. Say we 
want to perform the gate <7, at position pi of these 2n encoded qubits. We will surround the qubits (and the previous 
gate descriptions) with a description of (gi,Pi), as in: 

with for Pi < 0, k = loil)® 4 "- 1 - 2 ^! ® \0l( gi )l) <g> |001)® 2|Pl1 

r t = \0r( 9l )2) <g> I001)® 4 "- 1 
and for Pl > 0, k = I001)® 4 "- 1 <g> |(M(ft)l) 

r, = |001)® 2 I»I ® |0r(s0l) ® lOOl)® 4 "- 1 " 2 ^ 
moreover l(Swap) = r(Swap) = 1(1® H) = 2 

l(cPhase) = r(cPhase) = r(i? ® I) = 3 

This deserves some explanations. Say, as in the previous example, that we want for our fourth gate to have a Swap 
happen at position 0. We know from Table |TT| that in order for U to perform a Swap upon its data subsystems, 
we need both program registers to be in state |2). So we need to have a program signal coming from the left and 
holding value |2) = \l(Swap)) , and the same from the right. Moreover we need them to originate equally far from the 
center so that they meet above the center, and we need to make sure that the next gate gets encoded far away enough 
so that its program signals do not intersect ours - thereby inducing some unwanted operations. Finally the data 
signals nee d to b e sandwiched by Middle grey mode signals so that they remain stationary, very much as explained in 



Subsection [ill D| Here l 3 = lOOl)® 2 ™" 1 <g> |021) and r 3 = |021) <g> I001)® 2 "- 1 will do the job. The rest is only a matter 
of placing the program signals in a adequate manner so that the gate is performed in the right position. Note that in 
the left/right encoded circuit description region, only one cell out of two is potentially used, because the other cells 
can only move in the opposite direction which will not go over our data, see Figure |13| More convoluted encodings 
can be used to save up space and time, but only marginally. Also they seem to make the size of the encoding of a 
gate gi dependent upon its position p t , which makes it more difficult to explain. What we have here does formalize 
the way to address each elementary quantum gate with respect to the data qubits in our scheme. It shows that we 
are able to simulate any such circuit C by using up 8mn cells during Amn steps. 

Intrinsic simulation of any partitioned one- dimensional QCA. Third, let us show that this particular PQCA is in- 
trinsically universal. In order to do this, consider a PQCA has scattering unitary V, and say we seek to approximate 
it just as illustrated in Figure [2] In order to do so we must consider the quantum circuit C(V) which approximates 
V in terms of the quantum gates (gi)i=o... m -i acting at positions (pi)i=— n ...n-i- The approximated QCA as acting 
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L: 2n+2k n n R: 2n+2k 



FIG. 13: Encoding a gate g. Here the position of the gate relative to the center position is k = —1, meaning that the gate will 
be applied not on the center qubits but on the pair just left of it. And here the radius of the data signals bus is n = 3, meaning 
that three qubits are needed to code for just one cell of the simulated QCA. The two black lines represent the right-moving and 
left-moving program signals that make up the encoding of the gate g. The fact that the right-moving program signal originates 
further apart from the stationary data signals than its left counterpart is what codes for the position k = — 1. In the worst case 
scenario the position is — n+ 1. Clearly we must not send another left-moving signal until the worst case scenario right-moving 
signal has come out of the data signals, otherwise their collision may trigger the application of an unwanted gate. So each gate 
requires at least 2(n + k) cells on each side in order to be encoded. In the chosen encoding we systematically take An cells on 
each side in order to encode one gate. 



upon the initial state \<fi), can be encoded in our QCA via repetitions of the state \T(C, 4>,k)): 

\T(C, <p, k)) = |011) <g> w ® |011) ® F\<f>) 2k > 2k+1 
with for Pl < 0, h = lOll)® 2 "- 1 -!^! ® |0/( 5i )l) <g> 1011)®^ 

n = \0r( 9i )l) <8> I011)® 2 "- 1 
and for Pt > 0, h = lOll)® 2 ™" 1 <g> |0%)1) 

n = \0ll)®^ ® \0r( gi )l) ® loil)^ 2 "- 1 -^! 

?Tl-l 

moreover r = n, I = k 

i—0 i—m—1 

w = (£)l k ®r k 

k 



Here the scheme is exactly that of Fig. 11 Compared to the previous coding here is what deserves explanations. 
The data is encoded exactly in the same way, namely each pair of cells \<fi) k ' k+1 is encoded as 2n cells F\(f>) k,k+1 . 
Then the description of the circuit C is also encoded in the same way, in the surroundings of each encoding cells. 
There are a couple of differences however. Previously we were using only one out of two cells in the left/right encoded 
circuit description region, but now we can interleave the left and right encoded circuit descriptions to use them all, 
this is what w does and the reason why the factor of 2 was dropped in the h/vi descriptions. Also, we now need 
to handle mode and 'Change colour' signals appropriately so as to obtain the ternary background pattern and the 
hexagonal data signals flow. By inspection of Fig. [9] we see that this is done by setting the left/right encoded circuit 
description regions to having subsystems mode at 'Middle grey' and separating them from the encoded pair of cells 
with a 'Change colour' signal (the |011) cells). 

Again this is not quite optimal, but it does entail the following result. 

Theorem 1 There exists G' a U -defined QCA which is intrinsically universal QCA in the following sense. Let G be 
V -defined QCA such that V can be expressed as a quantum circuit C made of m gates acting upon 2n qubits. Then G' 
is able to intrinsically simulate G with space expansion factor s — Anm + 2 + In and time expansion factor t — (3/2) s. 

Proof. Each cell of the ^-defined QCA will be encoded into a string of cells of the [/-defined QCA which we have 
described in this Section, according to the formula \T(C, <j), k)) given in this Subsection (remember that that the two 
cells 2k and 2k + 1 of a PQCA are really just the k cell of the corresponding QCA in the original definition, as we 



explained in Subsection II C I. This constitutes our isometric encoding E. The size of |r(C, ^,fc)) can be seen to be 
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Anm + 2 + 2n, which explains the value of the space expansion factor s. The ratio between the time expansion factor 



and the space expansion factor is deduced by inspection of Figure 1 1 



□ Note that if the scattering unitary V is only approximated with an error of e = maxi,M ||V|^>) — C(V) by 
the quantum circuit C(V), then this entails we are able to intrinsically approximate the evolution of s cells over t 
steps with an error bounded by ste - again using supercells of size s — Anm + 2 + 2n and a time expansion of factor 
t = (3/2)s. This is the general statement that errors in quantum circuits grow no more than proportional to time 
and space [IB], which stems from the fact that if \\U - U'\ \ < e then \\U® S - U'® s \ \ < se and \\U l - U n \ \ < te. 



IV. STRONG INTRINSIC UNIVERSALITY 



The idea of tackling the problem of strong intrinsic universality was suggested to us by Torsten Franz, whom we 
would like to thank. 



A. Why do we need another flavour? 



The QCA explained in subsection IIIF has one problem: one needs to be able to prepare an infinite initial 
configuration somehow. If we do not do that, then the quiescent cells surrounding the background pattern will slowly 
begin to mix with the pattern, thus rendering it unusable. As can be seen in Fig. [14J after certain amount of time, 
the whole ternary background pattern will be reduced to a single cell, and no more computation is possible. Thus, in 
order to perform an arbitrary long computation, we have to know the length of the computation beforehand, so that 
we can prepare an initial configuration of adequate width and protect the ternary background pattern long enough 
for the computation to finish. But in practice finding out how long the computation is going to last is a very difficult 
task, and may even be undecidable. In order to solve this problem, we need a second flavour to the notion of intrinsic 



universality, namely strong intrinsic universality, as was formally presented and defined in Subsection II B Hence we 
shall build a new QCA that has the ability to weave the ternary background pattern as the computation proceeds. 
This section will be organized as follows: first we give an intuitive idea about the mechanisms we use to solve this 
problem, then we explain the different components of the QCA in detail, and finally we explain how the whole system 
fits together. 




FIG. 14: The ternary background pattern will shrink as the computation goes on and the surrounding quiescent cells enter the 
computation region. 
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B. Intuition 



In John Conway's 'Game of Life' CA, there are objects called 'guns' that can fire cells in different directions at 
different time intervals. Here we use a similar idea in order to fire the signals that make up the ternary background 
pattern. Essentially, we use what we call a 'copy band', i.e. a structure made out of two walls and a synchronization 



signal bouncing between them, as in Fig. 15 The distance between the walls controls the frequency of duplication, 
and the bouncer signal decides when to fire a piece of signal constituting the ternary background. Recall that this 
background was essentially made out of two signals: the mode (the mode subsystem is described in subsection III F I , 
and the program (each program signal carries half of the information needed to code for a universal quantum gate, 
refer back to Table [IT] to see how each gate is encoded as the state of two program subsystems). The left/right 
half of the coding of a universal quantum gate will hence be held by a left/right copy band. Since the fired mode 
and program signals travel at maximum speed, they appear on the space-time diagram of the QCA as having a 
45/135-degree angle with respect to the horizontal line. So the two program signals each encoding one half of the 
program will eventually meet in the middle, inducing the application of a quantum gate. The program signals are 
being duplicated at a constant rate, thus creating the 'squares' in Fig. [JJ Part of the difficulty is to determine what 
level of grey should be sent (the value of the mode signal). This will be explained further down. The speed of the 
computation, manifested in Fig. [7] as the size of the 'squares', can be controlled by adjusting the distance between 
the walls and hence the frequency of the bounces. 



C. The copy band 



We continue with the convention described in Subsection |III A[ i.e. the names of the subsystems will be given in 
bold (Examples of subsystems are the mode and program subsystems in Table III I , and names of the signals will 
not (An example of a signal is the bouncer signal in Fig. 15). 

A copy band is shown in Fig. 15 It is composed of a left wall, a right wall and the bouncer signal traveling in 
between. The subsystems used in order to implement the copy band are summarized in Table |III| 




FIG. 15: A band. Both walls of the band contain a counter signal, and the left wall also contains a copy of the program signal 
which will be fired later in order to make up the ternary background pattern. The bouncer signal can be seen bouncing between 
the walls, with different grey level corresponding to different values of this signal. The duplicated program signal propagates 
to the right at light speed. Time flows upwards. 



The program signal that encodes half the information of a quantum gate is kept in the outer wall. This wall also 
stores the counter, which is used to slow down the speed of the walls by counting from 1 to 3 and then letting the 
walls move right/left only when the value of the counter signal is 3, the meaning that there is no counter signal. 
The bouncer signal is placed next to this wall in the initial configuration. When the computation begins, the bouncer 
signal will travel towards the opposite wall at maximum speed. When it hits the outer wall, its value will change. 
The value of the bouncer signal is essentially another counter that cycles from 1 to 3, the meaning that there is no 
bouncer signal. Hence each change of value is just the sequence of operation — 1, (+1 mod 3), +1, applied only when 
the value differs from 0. For simplicity we refer to this operation as Inc 3 . Whenever the bouncer signal hits the outer 
wall, the program/mode signals that constitute the background pattern will be fired. Namely, a copy of the value of 
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FIG. 16: Two copy bands placed side by side. The band will not alter a program signal that is already traveling at light speed, 
so the program signals emitted by the band to the left just pass through the band to the right. Time flows upwards. 



the copy subsystem (cf. Table IV I will be placed in the program subsystem, and the value of the mode subsystem 
will be set appropriately. The way this is done involves the value of the counter and the history subsystems, and 
will be explained in detail in subsection |IVD| 

Since each copy band only holds half the information needed to code a quantum gate in the ternary background 
pattern, another band is needed in order to reconstruct the gate. By exploiting the symmetry of the QCA depicted 
in subsection |III F[ we use two symmetrically placed bands to 'weave' one gate in the ternary background pattern, as 
shown in Fig. |17| 



D. Mode synchronization 



In subsection |IIIF| all gates start in the same mode at time 0, and the mode cycles each time a 'change colour' 
signal is encountered. Since program signals start and travel at the same speed towards each other, it takes them 
an equal amount of time to meet in the middle, meeting the same number of 'change colour' signals on the way, 
thus making mode synchronization automatic. In the QCA described in this section, however, gates start in different 
modes at different times, and although they still travel at the same speed towards each other, they may very well 
meet different numbers of 'change colour' signals on the way, so when two program signals meet, their mode may be 
out of sync, which may wrongly render this gate inactive/active. To address the problem, we used the information 
stored in the history subsystem and value of the bouncer subsystem in order to compute the correct value for this 



Name 


Size 


Function 


counter 


4 


The counter guarantees that the ternary background pattern grows at a constant 
rate. It increments by one at each step, and whenever it gets to 3, the copy band 
will start to shift left (or symmetrically, right ) to transform one quiescent cell 
into part of the ternary background pattern. 


wall 


3 


Walls are what mark the border of a 'copy band', separates it from the quiescent 
cells and the ternary background pattern. Cells that are not part of any copy 
band have value |0) (i.e. quiescent cells, the ternary background pattern and data 
cells). The walls that are closer to the half-line of quiescent cells have value |1), 
they are referred to from now on as 'outer walls'. The right walls in the left half 
of the QCA have value |2), they are referred to as 'inner walls' from now on. 


bouncer 


4 


The bouncer is a signal that travels inside the copy band, and changes its value 
whenever it hits a wall of value Every time the bouncer bounces on the outer 
wall, a copy operation is performed on the copy subsystem. The resulting copy 
is put in the program subsystem, which travels at maximum speed to construct 
gates in the ternary background pattern. Together with the history subsystem, 
it enables calculation of the correct mode value for the fired ternary background 
signal. 


history 


3 


To calculate the correct mode of the duplicated program bit, a 'history' of how 
many mode-change signals have been encountered by this copy band is indispens- 
able. Together with the bouncer subsystem, the correct mode can then be in- 
ferred. 



TABLE III: Subsystems of the copy band 
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FIG. 17: Universal quantum gates encoded by two symmetrically placed copy bands. Gates are formed when two duplicated 
programs traveling in opposite directions intersect. Seven gates can be seen in the graph. 



mode signal. 

The bouncer signal decides when to fire the program signal and in which mode. It has three possible values, each 
corresponding to a value of the fired mode signal. The bouncer signal cycles through these values each time it 'touches' 
the outer wall (i.e. wall of value |1)), at this time the program stored in the outer wall will also be duplicated and 
fired as a signal, with the current value of the bouncer signal as its default mode, so to say. 

The history subsystem stores the number of 'change colour' signals that a program signal 'would have encountered' 
if the signal had been sent at time 0. The effects of these 'hypothetical' signals cannot be ignored, we must keep 
track of them modulo 3. So actually the value of the fired mode will be that of the bouncer *plus* the offset kept by 
history. This mechanism is illustrated in Figure [T8| 




Legends: 

Colour change Signal Copy BanO with History 

Program Signal Copy Band with History 1 

'Hypothetical' Colour change Signal Copy Band with History 2 



FIG. 18: Synchronization of the mode of the fired ternary background signal. The initial value of the history subsystem should 
be the number of 'change colour' signals the copy band would have been encountered had the ternary background pattern been 
complete (i.e. had the 'hypothetical' signals actually been sent). In the depicted situation, the value of the history subsystem 
of the outer copy bands would be 1 because the outer bands would have encountered a 'change colour' signal if the region 
occupied by the copy bands had been part of the initial configuration. 
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E. Fitting things together 

1. Components and circuit overview 

Tabic |TV| summarizes the subsystems that make up a cell. In addition to the ones already mentioned in Table [I] and 
|III| there is also the copy subsystem, which is used to store the value of the program signal to be fired. 



Name 


Size 


Function 


counter 


3 


cf. Tabic [TTT| 


wall 


3 


cf. Tabic [TIT] 


bouncer 


4 


cf. Tabic [TTT] 


history 


3 


cf. Tabic [TTT] 


data 


3 


cf. Tabic [I] 


copy 


4 


The system that carries half the information needed to construct a gate. This 
information is permanently stored in a copy band, and will be duplicated when a 
ternary background signal needs be fired. It has the same size as the program 
subsystem of subsection |III Fj 


program 


4 


cf. Tabic [I] 


mode 


3 


cf. Tabic [T] 



TABLE IV: Subsystems of the QCA 



The circuit of the QCA has a similar structure to the circuit depicted in Fig. |12| albeit being more complex. To help 
visualize the circuit, it is divided into several different 'layers', with each layer performing one or two basic functions. 
A schematic diagram of the different layers is given in Fig. [19] We will layout the circuit diagrams for the different 
layers and explain their functions in detail below. We follow the same conventions in the detailed diagrams as in 
Fig. [12] (with horizontal lines representing control lines, and different subsystems represented by different vertical 
lines, time flowing upwards. The xs denote subsystems of the left cell, and the ys denote subsystems of the right cell.) 



X' Y' 

1 L 



Displacement and Interaction 








Program Duplication 








Mode Computation 








Copy Band Expansion 


3 




( 



FIG. 19: The schematic layer diagram of the QCA's circuit. The X and Y lines denote the left and right cells undergoing the 
transformation. Time flows upwards. 



2. Layers of the QCA 's circuit 



• The Copy Band Expansion Layer (Fig. 20 1: This layer 'expands' the ternary background two new cells at a 
time (one cell one the left, one cell on the right) by shifting the walls of the copy bands outwards, converting 
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quiescent cells into part of the background pattern. Because the copy and the history subsystems are stored 
in the walls, they follow the expansion of counter and wall subsystems. The counter subsystem controls the 
swapping of copy, history and wall subsystems. That is, the swap gates are applied whenever the value of 
xcounter or ycounter is |3). The SC gate performs the function of swapping the value of xcounter and 
ycounter whenever one of them is |3). It is unitary because it is essentially just a permutation of the different 
base vectors of the subsystems upon which it acts. Its permutation table is given in Table [Vj This layer also 
lets the bouncer signal travel and bounce, i.e. the values of xbouncer and ybouncer are swapped whenever 
the value of xwall and ywall are both |0) (i.e. the bouncer signal is traveling between the walls). Depending 
on the position of the copy band (i.e. the band is at the left or right side of the ternary background), only one 
Inez gate will be applied. This gate is to cycle the value of the bouncer signal when the value of xwall or ywall 
is |1>. 



xcopy' ycopy' xhistory' yhistory' xcounter' ycounter' xwall' ywall' xbouncer' ybouncer' 



SC 



SWAP 



SWAP 



SWAP 



Ino 



Inc- 



Inc: 



InC' 



SWAP 



xcopy ycopy xhistory yhistory xcounter ycounter xwall ywall xbouncer ybouncer 

FIG. 20: The copy band expansion layer. 



The SC gate 


Input 


Output 


|30) 


<03| 


|03) 


<30| 


131} 


<13| 


|13) 


<31| 


132) 


<23| 


|23) 


<32| 







TABLE V: The SC gate. 



The Mode Computation Layer (Fig. 21 1: this layer implements the rest of the synchronization mechanism 



discussed in subsection |IVD| Here both the counter and the history subsystems must be used to determine 
the correct mode of the fired program signal. The bouncer subsystem contains the 'base mode' of the program 
signal. We call it 'base mode' because in order to compute the correct mode, we must 'offset' the base mode by 
the value of the history subsystem. The function of the CM gate is to 'offset' the base mode by the value of 
the history subsystem, giving a new mode and leaving the history subsystem unchanged. To understand this 
one must know that even though the ternary background signal is fired at regular time intervals by the outer 
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wall of a copy band, its mode is only worked out when it crosses the inner wall and leaves out the copy band. 
Hence the CM gate is only applied when the value of xwall is |0), the value of ywall is |2), to make sure that 
the gate is applied at the inner wall, and when there is a non-empty xprogram (to make sure that a ternary 
background signal is crossing this inner wall). The CM gate shown in the figure only acts on half of the copy 
bands, namely the copy bands that fire program signals to the right. Because the QCA is symmetric, the CM 
gate that acts on the other half can be obtained by replacing every x in the figure with y. Another function 
played by this layer is the incrementation of the history. Recall that the value of history is incremented when 
the inner wall that holds the history subsystem meets a program signal with value |4) (cf. the program 
subsystem in Table [I]) . But the inner wall may also meet the program signal fired by the outer wall of the same 
copy band. To differentiate these two program signals, we can use the bouncer signal: the program signal fired 
by the outer wall always travel with the bouncer signal, which is non-zero, until it acquires its mode and leaves 
the copy band. On the other hand, the program signal coming from the 'outside' of the copy band does not have 
a bouncer signal traveling with it. Contrary to the CM gate, the +1 mod 3 gate shown in the figure acts on 
copy bands which fire program signals to the left. The +1 mod 3 gate is applied when the value of xprogram 
is |4), and the value of xbouncer is not zero (to make sure the 'change colour' signal comes from the outside of 
the copy band), and the value of ywall is |2) (to make sure we are applying the gate to the yhistory subsystem 
in the inner wall). 

xprogram' xbouncer' xwall' ywall' xmode' yhistory' 

















+1 mod 3 


















CM: 

(xmode+((xbouncer+yhistory) mod 
3+1))) mod 3 ® yhistory 





























xprogram xbouncer xwall ywall xmode yhistory 



FIG. 21: Left half of the mode computation layer. 



The Program Duplication Layer (Fig. 22 1: the function of the CP gate in this layer is to change the value of 



program subsystem to the value stored in copy subsystem whenever a bouncer signal meets the outer wall 
that stores the copy subsystem. It is also responsible for incrementing the value of the bouncer signal. This 
figure also shows only the left half of the actual diagram. The CP gate is applied when the value of xwall is 
|0), the value of ywall is |1) (to make sure the gate is applied to the outer wall), and a non-zero xbouncer (to 
make sure that the bouncer signal has reached the outer wall). 

• The Displacement and Interaction Layer: this layer is identical to the scattering unitary U of the weak intrinsic 
universal QCA described in subsection IIIF (cf. Fig. 12 1. It only acts on the three subsystems given in Table I 
while leaving the other subsystems unchanged. It is responsible for moving the duplicated program and data 
around, and performing the computation by having the reconstructed gates act on the data. 



3. Initial state preparation 



We now turn our attention towards the question of how to set up the initial state of out strong intrinsic QCA in 
order to simulate some ^-defined QCA. This will be very much alike what we did in Subsection III But for the QCA 
described in this section we must also set up the copy bands, and special attention must be paid to the initialization 
of the mode synchronization mechanisms. 

Determining the speed of computation. When setting up our strong intrinsic universal QCA to simulate a QCA 
having scattering unitary V, the first task is to determine the size s of the supercells used for the encoding. This then 
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xwall' ywall' xprogram' xcopy' xbouncer' 













CP: 

((xprogram+xcopy) mod 4)®xcopy 













xwall ywall xprogram xcopy xbouncer 

FIG. 22: Left half of the program duplication layer. 



determines how many steps t of the simulating QCA are needed to simulate one step of the simulated QCA. Refer 
back to Subsection III Bl for a formal definition of this s and t. 



This step is very much identical to that of Subsection III G Again the key elements in order to work out s are the 



sizes of the quantum circuit description C of the scattering unitary V, namely 2n the number of qubits upon which 
C acts and m the number of elementary universal quantum gates which make up C . Because each simulated cell will 
be encoded into the simulating QCA via just the same isometric coding E (with all of the extra subsystems left in 
|0)), we have s = Anm + 2 + 2n and t — (3/2) s once more. 

But then the number, width and the separation of the copy bands needs be worked out too. The number of copy 
bands is easily found: since two copy bands are necessary to encode one gate, there are Aran + 2 {Aran for program 
signals and 2 for change colour signals) copy bands in total. These Amn + 2 copy bands are placed symmetrically at 
the left and right side of the data, with each side containing 2mn + 1 copy bands. 

The rate at which the ternary background signals are fired is equal to the distance traveled by the bouncer signal 



from the inner wall to the outer wall (refer to Figure 15 for a visualization). We want this rate to be s/2, because 



there are s/2 ternary background signals to be fired on each side. Let us call d the initial value of the width of each 
band (the distance between the inner wall and the outer wall, excluding walls). Clearly this is what determines the 
rate at which the ternary background signals are fired, yet this rate is not just d, because in traveling this distance, 
the bouncer signal has to 'catch up' with the outer wall which is moving at speed one third, and hence 

s j lj lj lj 1 , 3 , 

- = d + -d + -d-\ d+ ...= ^-r~:d = -d 

2 3 9 27 1 - (1/3) 2 

so d = a/3 = ~(4mn + 2 + 2n) 
3 

Here the fact that may not be an integer arises from the flexibility of the initial configuration, namely the flexibility 
in choosing initial values of the counter subsystem. As a result, it is possible to find two values of d, one equals 
to and the other one equals to [-Is] , by adjusting the initial values of the counter subsystem in the inner and 
outer walls, that produce the same result (i.e. correspond to the same value of s). Note however, the choice of d and 
the initial values of the counter subsystem is not arbitrary, because the current neighborhood scheme of the QCA 
may render the walls moving in the wrong direction. If all the counters start in the same value, we can fix the value 
of d by round |s to the closest integer. The separation between different copy bands just needs to be done by leaving 
one quiescent cell in between them, so that the proximity of each others wall do not interfere with their functioning. 
Now by multiplying the widths and separations of the bands together with the number of bands, we obtain the size 
of the r parameter of the strong isometric coding, as was formally defined Subsection |IIB| In our scheme we have: 

r = (-s + 2 + 1) * (2mn+ 1) = (-(4mn + 2 + 2n) + 3) * (2nm + 1) 



Notice a nice feature of our construction: our copy bands move apart at speed 1/3, but the expansion rate of the 
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simulated QCA is at most s/2t, which is also 1/3. Hence the ternary background pattern, which is being constantly 
'weaved' by our copy bands, grows fast enough so that the data it holds never 'leak' out of it. This point is illustrated 
in Figure [53] 

Prepare the initial bouncer and history value. The next step would be to figure out the initial bouncer value, and 
the corresponding history value. The best way to do this is to use a fixed bouncer value for all copy bands, and 
use the technique of subsection |IV D| to trace the 'hypothetical' change colour signals, and calculate the appropriate 
history value for each band by hand. 

A complete example of the strong intrinsic universal QCA is shown in Fig. |23| It has four program bits, hence four 
copy bands at each side. The bands are numbered 1 to 8 from left to right, and their initial configurations are shown 
in Table |VI| This figure does not show the actual interaction between gates and data, so data just 'flow through' the 
ternary background pattern. 



Band 


Configuration 


1 


|30130000), |00001300), |00000000), |00000000), |30200001) 


2 


130130000), |00001300), |00000000), |00000000), |30200421) 


3 


|30130000), |00001300), |00000000), |00000000), |30200330) 


4 


|30110420), |00003400), |00000000), |00000000), |30200322) 


5 


|30200322), 100000000), |00000000), |00003400), |30110410) 


6 


|30200310), 100000000), |00000000), |00001300). |30130000) 


7 


|30200411), 100000000), |00000000), |00001300), |30130000) 


8 


|30200001), 100000000), |00000000), |00001300), |30130000) 



TABLE VI: Initial configurations of each copy band. The subsystems in each cell is ordered by 

| counter \ data \ wal I \ copy \ bouncer\program \ mode \ history) 




FIG. 23: A complete example of the QCA. The ternary background pattern is produced by the four pairs of copy bands. Two 
data bits are shown as two white lines propagating through the ternary background pattern. In this example no quantum gates 
act on the data bits. Time flows upwards. 



F. Formal results 



The strong intrinsic universal QCA shares many formal properties with the weak one. The way that universal 
quantum gates are constructed and addressed, as described in subsection IIIG| can be applied directly to the strong 
intrinsic universal QCA. The additional structures, namely the copy bands, correspond to the L and R isometries in 
Definition [TT] With the construction details described in this section, we have shown the following theorem: 
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Theorem 2 There exists G a U -defined QCA which is strongly intrinsically universal QCA in the following sense. 
Let G be V -defined QCA such that V can be expressed as a quantum circuit C made of m gates acting upon 2n qubits. 
Then G is able to strongly intrinsically simulate G with space expansion factor s — Anm + 2 + 2n, time expansion 
factor t = (3/2)s. Moreover if the initial configuration we want to simulate is of size x, then the initial configuration 
of the simulating configuration is of size sx + 2r with r = (|(4mn + 2 + 2n) + 3) * (2nm + 1). 

Proof. The E encoding is the same as the one in Subsection |III G and the L and R are the constructions of the let 

and right 'copy bands' described in this section. Their different sizes have been worked out previously. 

□ 



V. CONCLUSION 



Main claim and future work. We have formalized the notion of a QCA capable of simulating all others with linear 
overhead, exactly if the scattering unitaries they are made of decompose into a circuit of elementary quantum gates, 
and approximately otherwise. We have constructed such an intrinsic universal QCA, which turns out to be a Par- 
titioned QCA (Figure [IJ of cell-dimension 36 and whose scattering unitary we have given explicitly (Figure 12 and 
Subsection IIIF). If we insist that the initial configuration of the simulating QCA be finite (and not just periodic) 
we get to a stronger notion of universality, which was also formalized here. We have also constructed such a strong 
intrinsic universal QCA, which turns out to be a Partitioned QCA (Figur e [T| o f cell-dimension 15552 and whose 



scattering unitary we have given explicitly (Figures 19 to 22 and Subsection 



IVE) 



Clearly the main challenge we now face is to find an intrinsically universal QCA in n > 1 dimensions. The con- 
struction described in this paper is unlikely to be useful then, as it seems to blow up in complexity. But surprising 
as it may seem, in the classical reversible cellular automata literature intrinsically universal n-dimcnsional reversible 
cellular automata turn out have appeared before the intrinsically one-dimensional reversible cellular automata [18 . 
The common belief in the classical CA community seems to be that the n-dimensional constructions are actually 
simpler, due to the ability to use up the more than one-dimensional space to draw up circuits and sort out wire 
cross-overs. Intuitively at least, in n-dimensions intrinsic universality is very much the same as circuit universality. 
This is the direction which we plan to take in the close future. 
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